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Abstract — This paper studies the optimal investment and pricing decisions of a cognitive mobile virtual network operator (C-MVNO) 
under spectrum supply uncertainty. Compared with a traditional MVNO who often leases spectrum via long-term contracts, a C-MVNO 
can acquire spectrum dynamically in short-term by both sensing the empty "spectrum holes" of licensed bands and dynamically leasing 
from the spectrum owner. As a result, a C-MVNO can make flexible investment and pricing decisions to match the current demands 
of the secondary unlicensed users. Compared to dynamic spectrum leasing, spectrum sensing is typically cheaper, but the obtained 
useful spectrum amount is random due to primary licensed users' stochastic traffic. The C-MVNO needs to determine the optimal 
amounts of spectrum sensing and leasing by evaluating the trade off between cost and uncertainty. The C-MVNO also needs to 
determine the optimal price to sell the spectrum to the secondary unlicensed users, taking into account wireless heterogeneity of 
users such as different maximum transmission power levels and channel gains. We model and analyze the interactions between the 
C-MVNO and secondary unlicensed users as a Stackelberg game. We show several interesting properties of the network equilibrium, 
^ ■ including threshold structures of the optimal investment and pricing decisions, the independence of the optimal price on users' wireless 

characteristics, and guaranteed fair and predictable QoS among users. We prove that these properties hold for general SNR regime 
qq \ and general continuous distributions of sensing uncertainty. We show that spectrum sensing can significantly improve the C-MVNO's 

^vq ■ expected profit and users' payoffs. 

Index Terms — Cognitive radio, spectrum trading, spectrum sensing, dynamic spectrum leasing, spectrum pricing, Stackelberg game, 
Subgame Perfect equilibrium. 
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1 Introduction 

WIRELESS spectrum is typically considered as a 
scarce resource, and is traditionally allocated 
through static licensing. Field measurements show that, 
however, most spectrum bands are often under-utilized 
even in densely populated urban areas ( [2]). To achieve 
more efficient spectrum utilization, people have pro- 
posed various dynamic spectrum access approaches in- 
cluding hierarchical-access and dynamic exclusive use 
( |3|-|7J). Hierarchical-access allows a secondary (unli- 
censed) network operator or users to opportunistically 
access the spectrum without affecting the normal oper- 
ation of the spectrum owner who serves the primary 
(licensed) users. Dynamic exclusive use allows a spec- 
trum owner to dynamically transfer and trade the usage 
right of its licensed spectrum to a third party (e.g., a 
secondary network operator or a secondary end-user) in 
the spectrum market. This paper considers a secondary 
operator who obtains spectrum resource via both spec- 
trum sensing as in the hierarchical-access approach and 
dynamic spectrum leasing as in the dynamic exclusive use 
approach. 
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Spectrum sensing obtains awareness of the spectrum 
usage and existence of primary users, by using geoloca- 
tion and database, beacons, or cognitive radios (e.g., [8J- 
JTTJ) - The primary users are oblivious to the presence of 
secondary cognitive network operators or users. The sec- 
ondary network operator or users can sense and utilize 
the unused "spectrum holes" in the licensed spectrum 
without violating the usage rights of the primary users 
(e.g., (H, J7|). Since the secondary operator or users does 
not know the primary users' activities before sensing, the 
amount of useful spectrum obtained through sensing is 
uncertain (e.g. IT21-IT5I). 

With dynamic spectrum leasing, a spectrum owner 
allows secondary users to operate in their temporarily 
unused part of spectrum in exchange of economic return 
(e.g., 13, 0, HU). The dynamic spectrum leasing can be 
short-term or even real-time (e.g., fl7 | - IIT9ll ), and can be 
at a similar time scale of the spectrum sensing operation. 

In this paper, we study the operation of a cognitive 
radio network that consists a cognitive mobile virtual 
network operator (C-MVNO) and a group of secondary 
unlicensed users. The word "virtual" refers to the fact 
that the operator does not own the wireless spectrum 
bands or even the physical network infrastructure. The 
C-MVNO serves as the interface between the spectrum 
owner and the secondary end-users. The word "cog- 
nitive" refers to the fact that the operator can obtain 
spectrum resource through both spectrum sensing using 
the cognitive radio technology and dynamic spectrum 
leasing from the spectrum owner. The operator then 
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resells the obtained spectrum (bandwidth) to secondary 
users to maximize its profit. The proposed model is a 
hybrid of the hierarchical-access and dynamic exclusive 
use models. It is applicable in various network sce- 
narios, such as achieving efficient utilization of the TV 
spectrum in IEEE 802.22 standard |20| . This standard 
suggests that the secondary system should operate on a 
point-to-multipoint basis, i.e., the communications will 
happen between secondary base stations and secondary 
customer-premises equipment. The base stations can be 
operated by one or several C-MVNOs introduced in this 
paper. 

Compared with a traditional MVNO who only leases 
spectrum through long-term contracts, a C-MVNO can 
dynamically adjust its sensing and leasing decisions to 
match the changes of users' demand at a short time scale. 
Moreover, sensing often offers a cheaper way to obtain 
spectrum compared with leasing. The cost of sensing 
mainly includes the sensing time and energy, and does 
not include explicit cost paid to the spectrum owner. 
With a mature spectrum sensing technology, sensing cost 
should be reasonable low (otherwise there is no point 
of using cognitive radio). Spectrum leasing, however, 
involves direct negotiation with the spectrum owner. 
When the spectrum owner determines the cost of leasing, 
it needs to calculate its opportunity cost, i.e., how much 
revenue the spectrum can provide if the spectrum owner 
provides services directly over it. It is reasonable to 
believe that the leasing cost is more expensive than the 
sensing cost in most cases^. Although sensing is cheaper, 
the amount of spectrum obtained through sensing is 
often uncertain due to the stochastic nature of primary 
users' traffic. It is thus critical for a C-MVNO to find the 
right balance between cost and uncertainty. 

Our key results and contributions are summarized 
as follows. For simplicity, we refer to the C-MVNO as 
"operator", secondary users as "users", and "dynamic 
leasing" as "leasing". 

• A Stackelberg game model: We model and analyze 
the interactions between the operator and the users 
in the spectrum market as a Stackelberg game. As 
the leader, the operator makes the sensing, leasing, 
and pricing decisions sequentially. As the followers, 
users then purchase bandwidth from the operator 
to maximize their payoffs. By using backward in- 
duction, we prove the existence and uniqueness 
of the equilibrium, and show how various system 
parameters (i.e., sensing and leasing costs, users' 
transmission power and channel conditions) affect 
the equilibrium behavior. 

• Threshold structures of the optimal investment and 
pricing decisions: At the equilibrium, the operator 
will sense the spectrum only if the sensing cost 
is cheaper than a threshold. Furthermore, it will 
lease some spectrum only if the resource obtained 

1. The analysis of this paper also covers the case where sensing is 
more expensive than leasing, which is a trivial case to study. 



through sensing is below a threshold. Finally, the 
operator will charge a constant price to the users if 
the total bandwidth obtained through sensing and 
leasing does not exceed a threshold. The thresholds 
are easy to compute and the corresponding deci- 
sions rules are easy to implement in practice. 

• Fair and predictable QoS: The operator's optimal pric- 
ing decision is independent of the users' wireless 
characteristics. Each user receives a payoff that is 
proportional to its channel gain and transmission 
power, which leads to the same signal-to-noise 
(SNR) for all users. 

• Impact of spectrum sensing: We show that the avail- 
ability of sensing always increases the operator's 
profit in the expected sense. The actual realization 
of the profit at a particular time heavily depends 
on the spectrum sensing results. Users always get 
better payoffs when the operator performs spectrum 
sensing. 

Section [2] introduces the network model and prob- 
lem formulation. In Section [3j we analyze the game 
model through backward induction. We discuss various 
insights obtained from the equilibrium analysis and 
present some numerical results in Section [4] In Section 
[5j we show the impact of spectrum sensing on both the 
operator and users. We conclude in Section [6] and outline 
some future research directions. 

1.1 Related Work 

There is a growing interest in studying the investment 
and pricing decisions of cognitive network operators 
recently. Several auction mechanisms have been pro- 
posed to study the investment problems of cognitive 
network operators (e.g., [2"T|. |22]). Other recent results 
studied the pricing decisions of the cognitive network 
operators who interact with a group of secondary users 
(e.g., [23J-[3Ql). El considered users' queueing delays 
and obtained most results through simulations. [23J pre- 
sented a recent survey on the spectrum sharing games 
of network operators and cognitive radio networks. 
|24| studied the competition among multiple service 
providers without modeling users' wireless details. Il25l 
considered a pricing competition game of two operators 
and adopted a simplified wireless model for the users. 
l26l derived users' demand functions based on the ac- 
ceptance probability model for the users. E71 explored 
demand functions based on both quality-sensitive and 
price-sensitive buyer population models. |28| formulated 
the interaction between one primary user (monopolist) 
and multiple secondary users as a Stackelberg game. 
The primary user uses some secondary users as re- 
lays and leases its bandwidth to those relays to collect 
revenue. [29] studied a multiple-level spectrum market 
among primary, secondary, and tertiary services where 
global information is not available. l30l considered the 
short-term spectrum trading between multiple primary 
users and multiple secondary users. The spectrum buy- 
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Fig. 1. Operator's Investment in Spectrum Sensing and 
Leasing 



ing behaviors of secondary users are modeled as an 
evolutionary game, while selling behaviors of primary 
users are modeled as a noncooperative game. 11261 - 130) 
obtained most interesting results through simulations. 
There are only few papers (e.g., QU, (29), J3TJ) that 
jointly considered the spectrum investment and service 
pricing problem as this paper. None of the above work 
considered the impact of supply uncertainty due to 
spectrum sensing. 

Our model of spectrum uncertainty is related to the 
random-yield model in supply chain management (e.g., 
Il32l -f34l). The unique wireless aspects of the system 
model lead to new solutions and insights in our problem. 

Our paper represents a first attempt of understanding 
how spectrum uncertainty impacts the economic deci- 
sions of an cognitive radio operator. To obtain sharp 
insights, we focus on a stylized model where a monopo- 
list operator faces a group of secondary users. There are 
many more interesting research issues in this area. Some 
are further discussed in Section [6] 



2 Network Model 

2.1 Background on Spectrum Sensing and Leasing 

To illustrate the opportunity and trade-off of spectrum 
sensing and leasing, we consider a spectrum owner who 
divides its licensed spectrum into two types: 

• Service Band: This band is reserved for serving the 
spectrum owner's primary users (PUs). Since the 
PUs' traffic is stochastic, there will be some unused 
spectrum which changes dynamically. The operator 
can sense and utilize the unused portions. There are 
no explicit communications between the spectrum 
owner and the operator. 

• Transference Band: The spectrum owner temporarily 
does not use this band. The operator can lease the 
bandwidth through explicit communications with 
the spectrum owner. No sensing is allowed in this 
band. 

Due to the short-term property of both sensing and 
leasing, the operator needs to make both the sensing and 
leasing decisions in each time slot. 

The example in Fig. 1 demonstrates the dynamic 
opportunities for spectrum sensing, the uncertainty of 
sensing outcome, and the impact of sensing or leasing 



TABLE 1 
Key Notations 



Symbol 


Physical Meaning 


B s 


Sensing bandwidth 


Bi 


Leasing bandwidth 


C s 


Unit sensing cost 


Ci 


Unit leasing cost 


a G [0, 1J 


Sensing realization factor 




Set of secondary users 


7T 


Unit price 


Wi 


User i's bandwidth allocation 




User i's data rate 


nmax 
»■ 


User i's maximum transmission power 


hi 


User i's channel gain 


no 


Noise power density 


9i = P^hi/no 


User i's wireless characteristic 


SNR; = gt/wi 


User i's SNR 


G = 12iGX9i 


Users' aggregate wireless characteristics 


R 


Operator's profit 



decisions. The spectrum owner's entire band is divided 
into small 34 channel^ 

« Time slot 1: PUs use channels 1—4 and 11 — 15. The 
operator is unaware of this and senses channels 3—8. 
As a result, it obtains 4 unused channels (5 — 8). 
It leases additional 9 channels (20 — 28) from the 
transference band. 

« Time slot 2: PUs change their behavior and use 
channels 1—6. The operator senses channels 5 — 14 
and obtains 8 unused channels (7 — 14). It leases 
additional 5 channels (23 — 27) from the transference 
band. 

In this paper, we will only study the operator's deci- 
sions within a single time slot. We choose the time slot 
length such that primary users' activities remain roughly 
unchanged within a single time slot. This means that it 
is enough for the operator to sense at the beginning of 
each time slot. For traffic types such as TV programs, 
data transfer, and even VoIP voice sessions, the length 
of the time slot can be reasonable long. For readers who 
are interested in the optimization of the time slot length 
to balance sensing and data transmission, see lfl4l . 

2.2 Notations and Assumptions 

We consider a cognitive network with one operator and 
a set X = {1,...,/} of users. The operator has the 
cognitive capability and can sense the unused spectrum. 
One way to realize this is to let the operator construct 
a sensor network that is dedicated to sensing the radio 
environment in space and time l35ll . The operator will 
collect the sensing information from the sensor network 
and provide it to the unlicensed users, or providing 
"sensing as service". If the operator owns several base 
stations, then each base station is responsible for collect- 
ing sensing information in a certain geographical area. 
As mentioned in l35ll , there has been significant current 

2. Channel 16 is the guard band between the service and transference 
bands. 
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Stage I: Operator determines sensing amount B s 

realize available 
, , bandwidth B s a 

Stage II: Operator determines leasing amount B l 



v 

Stage III: Operator announces price n to market 



y 

Stage IV: End-users determine the demands for 
bandwidth from the operator 



Fig. 2. A Stackelberg Game 



research efforts in the context of an European project 
SENDORA (36l , which aims at developing techniques 
based on sensor networks for supporting coexistence of 
licensed and unlicensed wireless users in a same area. 
The users are equipped with software defined radios and 
can tune to transmit in a wide range of frequencies as 
instructed by the operator, but do not necessarily have 
the cognitive sensing capacity. Since the secondary users 
do not worry about sensing, they can spend most of 
their time and energy on actual data transmissions. Such 
a network structure puts most of the implementation 
complexity at the operator side and reduces the user 
equipment complexity, and thus might be easier to im- 
plement in practice than a "full" cognitive network. 

The key notations of this paper are listed in Table [I] 
with some explanations as follows. 

• Investment decisions B s and Bv. the operator's sensing 
and leasing bandwidths, respectively. 

• Sensing realization factor a: when the operator senses 
a total bandwidth of B s , only a proportion of a £ 
[0, 1] is unused and can be used by the operator, a 
is a random variable and depends on the primary 
users' activities. With perfect sensing results, users 
can use bandwidth up to B s a without generating 
interferences to the primary users. 

• Cost parameters C s and Cf. the operator's fixed sens- 
ing and leasing costs per unit bandwidth, respec- 
tively. Sensing cost C s depends on the operator's 
sensing technologies. When the operator senses 
spectrum, it needs to spend time and energy on 
channel sampling and signal processing ( (371). Sens- 
ing over different channels often needs to be done 
sequentially due to the potentially large number of 
channels open to opportunistic spectrum access and 
the limited power /hardware capacity of cognitive 
radios ( |38|). The larger sensing bandwidth and the 
more channels, the longer time and higher energy it 
requires ( 13910 . For simplicity, we assume that total 
sensing cost is linear in the sensing bandwidth B s . 
Leasing cost C\ is determined through the negotia- 
tion between the operator and the spectrum owner 
and is assumed to be larger than C s . 

• Pricing decision ir: the operator's choice of price per 
unit bandwidth to the users. 



2.3 A Stackelberg Game 

We consider a Stackelberg Game between the operator 
and the users as shown in Fig. |2] The operator is the 
Stackelberg leader: it first decides the sensing amount 
B s in Stage I, then decides the leasing amount Bi in 
Stage II (based on the sensing result B s a), and then 
announces the price ir to the users in Stage III (based 
on the total supply B s a + Bi). Finally, the users choose 
their bandwidth demands to maximize their individual 
payoffs in Stage IV. 

We note that "sensing followed by leasing" is optimal 
for the operator to maximize profit. Since sensing is 
cheaper than leasing, the operator should lease only 
if sensing does not provide enough resource. If the 
operator determines sensing and leasing simultaneously, 
then it is likely to "over-lease" (compared with "sensing 
followed by leasing") to avoid having too little resource 
when a is small. "Leasing before sensing" can not 
improve the operator's profit either due to the same 
reason. 

3 Backward Induction of the Four- 
stage Game 

The Stackelberg game falls into the class of dynamic 
game, and the common solution concept is the Subgame 
Perfect Equilibrium (SPE, or simply as equilibrium in this 
paper). A general technique for determining the SPE is 
the backward induction ( [40]). We will start with Stage 
IV and analyze the users' behaviors given the operator's 
investment and pricing decisions. Then we will look at 
Stage III and analyze how the operator makes the pricing 
decision given investment decisions and the possible 
reactions of the users in Stage IV. Finally we proceed to 
derive the operator's optimal leasing decision in Stage 
II and then the optimal sensing decision in Stage I. The 
backward induction captures the sequential dependence 
of the decisions in four stages. 

3.1 Spectrum Allocation in Stage IV 

In Stage IV, end-users determine their bandwidth de- 
mands given the unit price ir announced by the operator 
in stage III. 

Each user can represent a transmitter-receiver node 
pair in an ad hoc network, or a node that transmits to the 
operator's base station in an uplink scenario. We assume 
that users access the spectrum provided by the opera- 
tor through FDM (Frequency-division multiplexing) or 
OFDM (Orthogonal frequency-division multiplexing) to 
avoid mutual interferences. User i's achievable rate (in 
nats) i^§ 

/ pmaxL.X 

r i (w i )=w i ]n 1 + ^ *-), (1) 

3. We assume that the operator only provides bandwidth without 
restricting the application types. This assumption has been commonly 
used in dynamic spectrum sharing literature, e.g., 1161 , 1191 . |24[, |41[. 
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where Wi is the allocated bandwidth from the operator, 
pmax j s user j' s maximum transmission power, uq is the 
noise power per unit bandwidth, hi is user i's channel 
gain (between user i's own transmitter and receiver in 
an ad hoc network, or between user i's transmitter to the 
operator's base station in an uplink scenario). To obtain 
rate in (TTJ, user i spreads its maximum transmission 
power p™ ax across the entire allocated bandwidth w,;. 
To simplify the notation, we let gi = i^ max /ij/no, thus 
gi/wi is the user i's signal-to-noise ratio (SNR). Here 
we focus on best-effort users who are interested in 
maximizing their data rates. Each user only knows its 
local information (i.e., p* aax l hi, and no) and does not 
know anything about other users. 

From a user's point of view, it does not matter 
whether the bandwidth has been obtained by the opera- 
tor through spectrum sensing or dynamic leasing. Each 
unit of allocated bandwidth is perfectly reliable for the 
user. 

To obtain closed-form solutions, we first focus on the 
high SNR regime where SNR ^> 1. This is motivated by the 
fact that users often have limited choices of modulation 
and coding schemes, and thus may not be able to decode 
a transmission if the SNR is below a threshold. In the 
high SNR regime, the rate in {l} can be approximated as 

n(wi) = uiihx f — ) . (2) 

\WiJ 

Although the analytical solutions in Section|3]are derived 
based on (0, we emphasize that all the major engineering 
insights remain true in the general SNR regime. A formal 
proof is in Section [4] 

A user i's payoff is a function of the allocated band- 
width Wi and the price tt, 

Ui(TT,Wi) = Willi I — ) —TVWi, (3) 
\WiJ 

i.e., the difference between the data rate and the linear 
payment (irwi). Payoff Ui(n, wi) is concave in w%, and the 
unique bandwidth demand that maximizes the payoff is 

w*(ir) = argmaxui(7r, wA = gje~ (1+7r ', (4) 

which is always positive, linear in gi, and decreasing 
in price tt. Since gi is linear in channel gain hi and 
transmission power P j ? nax / then a user with a better 
channel condition or a larger transmission power has 
a larger demand. 

Equation ((U shows that each user i achieves the same 
SNR: 

SNR; = -JL- = e ^\ 
w*(tt) 

but a different payoff that is linear in gi, 

Ui(7r,<00) = <?*e- (1+7r) . 
We denote users' aggregate wireless characteristics as 
G = Yliei9i- The users' total demand is 

^<(7r) = Ge-( 1+ ^. (5) 




Fig. 3. Different intersection cases of D(tt) and S(tt) 

Next, we will consider how the operator makes the 
investment (sensing and leasing) and pricing decisions 
in Stages I-III based on the total demand in eq. In 
particular, we will show that the operator will always 
choose a price in Stage III such that the total demand 
(as a function of price) does not exceed the total supply. 

3.2 Optimal Pricing Strategy in Stage III 

In Stage III, the operator determines the optimal pricing 
considering users' total demand ©, given the band- 
width supply B s a + Bi obtained in Stage II. The operator 
profit is 

R(B S , a, Bt,n) = min ^ w* (tt), tt (Bi + B s a)^ 

- (B S C S + Bid) , (6) 

which is the difference between the revenue and total 
cost. The min operation denotes the fact that the operator 
can only satisfy the demand up to its available supply. 
The objective of Stage III is to find the optimal price 
7r* (B s , a, Bi) that maximizes the profit, that is, 

R in (B a) a,Bi) = maxR(B s ,a,B l ,Tr). (7) 

7T>0 

The subscript "III" denotes the best profit in Stage III. 

Since the bandwidths B s and B\ are given in this 
stage, the total cost B S C S + B[Ci is already fixed. The 
only optimization is to choose the optimal price tt to 
maximize the revenue, i.e., 

max min I tt w*(tt), tt (Bi + B s a) ) . (8) 
7r ^° \ iei / 

The solution of problem l(8| depends on the bandwidth 
investment in Stages I and II. Let us define D (tt) = 
T'J2i£i w i( 7r ) an d S(tt) = n(Bi + B s a). Figure |3] shows 
three possible relationships between these two terms, 
where Sj(ir) (for j = 1, 2, 3) represents each of the three 
possible choices of S(tt) depending on the bandwidth 
B t + B s a: 

• Si(n) (excessive supply): No intersection with D(tt); 

4. We assume that the operator knows the value of G through proper 
feedback mechanism from the users. 
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TABLE 2 

Optimal Pricing Decision and Profit in Stage III 



Total Bandwidth Obtained in 
Stages I and II 


Optimal Price it* (B s ,a, _B ( ) 


Optimal Profit R n i{B s ,a, B t ) 


Excessive Supply Regime: 

B t + B a a > Ge~ 2 


TT^' = 1 


Rf^Bs^a^Bt) = Ge~ 2 - B S C S - B l C l 


Conservative Supply Regime: 

B t + B s a < Ge -6 




RCfjlBs.^Bt) = (B, + B s a)ln(_SL_) - 
B s (a + C a ) - B t (l + Ci) 



• ^(tt) (excessive supply): intersect once with D(tt) 
where D(tt) has a non-negative slope; 

• S3(n) (conservative supply): intersect once with 
D(ir) where D(ir) has a negative slope. 

In the excessive supply regime, 

max 7r >o min (S(ir), D(ir)) = max^x) D(n), i.e., the 
max-min solution occurs at the maximum value of D(ir) 
with 7r* = 1. In this regime, the total supply is larger 
than the total demand at the best price choice. In the 
conservative supply regime, the max-min solution occurs 
at the unique intersection point of D(ir) and S(ir). The 
above observations lead to the following result. 

Theorem 1: The optimal pricing decision and the corre- 
sponding optimal profit at Stage III can be characterized 
by Table 12 

The proof of Theorem [TJ is given in Appendix [A] Note 
that in the excessive supply regime, some bandwidth 
is left unsold (i.e., S(tt*) > D(ir*)). This is because 
the acquired bandwidth is too large, and selling all the 
bandwidth will lead to a very low price that decreases 
the revenue (the product of price and sold bandwidth). 
The profit can be apparently improved if the operator 
acquires less bandwidth in Stages I and II. Later analysis 
in Stages II and I will show that the equilibrium of the 
game must lie in the conservative supply regime if the 
sensing cost is non-negligible. 

3.3 Optimal Leasing Strategy in Stage II 

In Stage II, the operator decides the optimal leasing 
amount Bi given the sensing result B s a: 



Rn(B s ,a) = ma,xR ln (B s ,a,Bi). 

B t >0 



(9) 



We decompose problem ((9) into two subproblems based 
on the two supply regimes in Table 12 

1) Choose Bi to reach the excessive supply regime in 
Stage III: 

Rff (B s , a) = max Rffj (B s ,a,Bi). 

Bi>max{Ge~ 2 —B s ot,0} 

(10) 

2) Choose Bi to reach the conservative supply regime 
in Stage III: 



RYf(B s ,a)= max i?^(B s , a, £,), (11) 

0<B l <Ge- 2 -B s a 

To solve subproblems (flOb and ifTT) , we need to con- 
sider the bandwidth obtained from sensing. 

• Excessive Supply (B s a > Ge~ 2 ): in this case, the 
feasible sets of both subproblems JTOb and iQj} are 



empty. In fact, the bandwidth supply is already in 
the excessive supply regime as defined in Table II, 
and it is optimal not to lease in Stage II. 
> Conservative Supply (B s a < Ge~ 2 ): first, we can show 
that the unique optimal solution of subproblem ((lOt 
is £> ( * = Ger 2 — B s a. This means that the optimal 
objective value of subproblem ((TO) is no larger than 
that of subproblem ((TT) . and thus it is enough to 
consider subproblem ((TT) in the conservative supply 
regime only. 

Base on the above observations and some further 
analysis, we can show the following: 

Theorem 2: In Stage II, the optimal leasing decision 
and the corresponding optimal profit are summarized 
in Table |3 

The proof of Theorem |2] is given in Appendix |B] Table 
[3] contains three cases based on the value of B s a: (CS1), 
(CS2), and (ES3). The first two cases involve solving 
the subproblem i fTTll in the conservative supply regime, 
and the last one corresponds to the excessive supply 
regime. Although the decisions in cases (CS2) and (ES3) 
are the same (i.e., zero leasing amount), we still treat 
them separately since the profit expressions are different. 

It is clear that we have an optimal threshold leasing 
policy here: the operator wants to achieve a total band- 
width equal to Ge~^ 2+Cl ^ whenever possible. When the 
bandwidth obtained through sensing is not enough, the 
operator will lease additional bandwidth to reach the 
threshold; otherwise the operator will not lease. 

3.4 Optimal Sensing Strategy in Stage I 

In Stage I, the operator will decide the optimal sensing 
amount to maximize its expected profit by taking the un- 
certainty of the sensing realization factor a into account. 
The operator needs to solve the following problem 

Rj = max Rj j (B s ) , 

B a >o 

where Rn (B s ) is obtained by taking the expectation of 
a over the profit functions in Stage II (i.e., Rfj S1 (B s ,a), 
Rff 2 (B s ,a), and Rff 3 (B s ,a) in Table©. 

To obtain closed-form solutions, we assume that the 
sensing realization factor a follows a uniform distri- 
bution in [0,1]. In Section [4.11 we prove that the major 
engineering insights also hold under any general distribution. 

To avoid the trivial case where sensing is so cheap 
that it is optimal to sense a huge amount of bandwidth, 



7 



TABLE 3 

Optimal Leasing Decision and Profit in Stage II 



Given Sensing Result B s a After Stage I 


Optimal Leasing Amount B* 


Optimal Profit R n (B s ,a) 


(CS1) B s a < Ge-( 2+c '') 




RvpiB^a) = Ge-V+ U i) +B s (<xCi-C s ) 


(CS2) B s a G (Ge-( 2 + c i),Ge- 2 ] 


B CS2 = 


Rff 2 (B s ,a) = B s aln(-^) -B s (a + C s ) 


(ES3) B s a > Ge- A 


Bf a3 = 


R??*(B s ,a) = Ge-- 2 - B S C S 



0.02 



0.015 



0.01 



0.005 




0.01 0.02 0.03 0.04 0.05 0.06 



Fig. 4. Expected profit in Stage II under different sensing 
and leasing costs 



we further assume that the sensing cost is non-negligible 



and is lower bounded by C s > (1 



-2C, 



)/4- 



To derive function Rn (B s ), we will consider the 
following three intervals: 

1) Case I: B s £ [0, Ge~^ 2+Cl ']. In this case, we always 
have B s a < Ge~ (2+C ^ for any value a e [0, 1], 
which corresponds to case (CS1) in Table [3] The 
expected profit is 



Rii(B s ) =Eae[o,i] [R?i S1 {Bs,a) 



C s 



which is a linear function of B s . If C s > Ci/2, 
R}j(B s ) is linearly decreasing in B s ; if C s < Ci/2, 
R\ I {B S ) is linearly increasing in B s . 
2) Case II: B s e (Ge- (2+Ci \ Ge~ 2 ] . Depending on the 
value of a, B s a can be in either case (CS1) or case 
(CS2) in Table |3] The expected profit is 



R 2 H (B S ) -Er Ss -y+c A - 



[R?fHB s ,a)] 



E 



-(2 + C,) 



R 



CS2 
II 



(B s ,a) 



Bs 
' 2 





B s 


4 


/Ge-(2+c,)\ 


(I) 


4 







The expected profit is 

r ii( b s) = E ae ^ ce-wq) j [Rii 1 { B s,a)] 

[R?f 2 (Bs,*)] 



? ES3/ 



G\ 2 e- 2C 



—— B S C S H — 

4EL e 2 



Because its first-order derivative 

dR? u {B s ) (Ge- 2 \ 2 1 



HI 
dBs 



-2d 



Cs < o, 



as B s > Ge 2 in this case, R 3 jj(Bs) is decreasing in 
B s and achieves its maximum at B s = Ge~ 2 . 

To summarize, the operator needs to maximize 

'Rh(B s ) 



Rn(B s 



R 2 tj{B s ), 
{B s ), 



R 3 



if < B s < Ge-( 2+c ''; 
if Ge-( 2+c <> < B s < Ge~ 2 ; 
if B x > Ge~ 2 . 



(12) 

We can verify that Case II always achieves a higher 
optimal profit than Case III. This means that the optimal 
sensing will only lead to either case (CS1) or case (CS2) 
in Stage II, which corresponds to the conservative supply 
regime in Stage III. This confirms our previous intuition 
that equilibrium is always in the conservative supply 
regime under a non-negligible sensing cost, since some 
resource is wasted in the excessive supply regime (see 
discussions in Section [3T2J. 

Table S] shows that the sensing decision is made in the 
following two cost regimes: 

• High sensing cost regime (C s > Ci/2): it is optimal 
not to sense. Intuitively, the coefficient 1/2 is due 
to the uniform distribution assumption of a, i.e., on 
average obtaining one unit of available bandwidth 
through sensing costs 2C S . 



- B s Cs 



R 2 j(B s ) is a strictly concave function of B s since 
its second-order derivative 



Low sensing cost regime (C s € — % — , ^ ): the 
optimal sensing amount B^* is the unique solution 
to the following equation: 



d 2 Rj I (B s 
dB 2 



1 

~2B, 



Ge~( 2 + C 'i 



< o 



dRjj(B, 



dB s 2 ln 



B s /G 



-(2+CY 



2B S /G 



as B s > Ge _ ( 2+C!) in this case. 
3) Case DDL: B s e (Ge~ 2 ,oo). Depending on the value 
of a, B s a can be any of the three cases in Table |3] 



= 0. 
(13) 



The uniqueness of the solution is due to the strict 
concavity of R 2 II (B S ) over B s - We can further show 
that Bf* lies in the interval of [Ge~ ( - 2+Cl \ Ge' 2 ] 
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TABLE 4 

Choice of Optimal Sensing Amount in Stage I 





Optimal Sensing Decision B* 


Expected Profit Rj 


High Sensing Cost Regime: C s > C;/2 


B* s =0 


Rf =Ge -(2+c ! ) 


Low Sensing Cost Regime: C s 6 (1 - e-' 2U ')/A, CJ2\ 


B* = Bjf", solution to eq. (13) 


R'f in eq. (14) 



and is linear in G. Finally, the operator's optimal 
expected profit is 

(14) 

Based on these observations, we can show the follow- 
ing: 

Theorem 3: In Stage I, the optimal sensing decision 
and the corresponding optimal profit are summarized 
in Table |U The optimal sensing amount B* is linear in 
G. 

Figure |4] shows two possible cases for the function 
Rn(B s ). The vertical dashed line represents B s = 
g -(2+C() p or illustration purpose, we assume G = 1, 
Ci = 2, and C s = {0.8,1.2}. When the sensing cost 
is large (i.e., C s = 1.2 > C//2), R n (B s ) achieves its 
optimum at B s = and thus it is optimal not to sense. 
When the sensing cost is small (i.e., C s = 0.8 < C//2), 
Ru(B s ) achieves its optimum at B s > e~( 2+c ^ and it is 
optimal to sense a positive amount of spectrum. 

4 Equilibrium Summary and Numerical 
Results 

Based on the discussions in Section |3j we summarize 
the operator's equilibrium sensing /leasing/ pricing de- 
cisions and the equilibrium resource allocations to the 
users in Table [5] Several interesting observations are as 
follows. 

Observation 1: Both the optimal sensing amount B* 
(either or B^*) and leasing amount Bf are linear 
in the users' aggregate wireless characteristics G = 

The linearity enables us to normalize optimal sensing 
and leasing decisions by users' aggregate wireless char- 
acteristics, and study the relationships between the nor- 
malized optimal decisions and other system parameters 
as in Figs. |5] and 

Figure [5] shows how the normalized optimal sensing 
decision B*/G changes with the costs. For a given leas- 
ing cost Ci, the optimal sensing decision B* decreases as 
the sensing cost C s becomes more expensive, and drops 
to zero when C s > Ci/2. For a given sensing cost C s , 
the optimal sensing decision B* increases as the leasing 
cost Ci becomes more expensive, in which case sensing 
becomes more attractive. 

Figure [6] shows how the normalized optimal leasing 
decision B\ /G depends on the costs Gi and C s as well 
as the sensing realization factor a in the low sensing 
cost regime (denoted by "L"). In all cases, a higher 



value a means more bandwidth is obtained from sensing 
and there is a less need to lease. Figure [6] confirms 
the threshold structure of the optimal leasing decisions 
in Section 13.31 i.e., no leasing is needed whenever the 
bandwidth obtained from sensing reaches a threshold. 
Comparing different curves, we can see that the operator 
chooses to lease more as leasing becomes cheaper or 
sensing becomes more expensive. For high sensing cost 
regime, the optimal leasing amount only depends on Ci 
and is independent of C s and a, and thus is not shown 
here. 

Observation 2: The optimal pricing decision n* in 
Stage III is independent of users' aggregate wireless 
characteristics G. 

Observation [2] is closely related to Observation [1] Since 
the total bandwidth is linear in G, the "average" resource 
allocation per user is "constant" at the equilibrium. This 
implies that the price must be independent of the user 
population change, otherwise the resource allocation to 
each individual user will change with the price accord- 
ingly. 

Observation 3: The optimal pricing decision tt* in 
Stage III is non-increasing in a in the low sensing cost 
regime. 

First, in the low sensing cost regime where the sensing 
result is poor (i.e., a is small as the third column in Table 
0, the operator will lease additional resource such that 
the total bandwidth reaches the threshold Ge~( 2+Cl \ In 
this case, the price is a constant and is independent of 
the value of a. Second, when the sensing result is good 
(i.e., a is large as in the last column in Table 0, the total 
bandwidth is large enough. In this case, as a increases, 
the amount of total bandwidth increases, and the optimal 
price decreases to maximize the profit. 

Figure shows how the optimal price changes with 
various costs and a in the low sensing cost regime. It 
is clear that price is first a constant and then starts to 
decrease when a is larger than a threshold. The threshold 
decreases in the optimal sensing decision of B^*: a 
smaller sensing cost or a higher leasing cost will lead 
to a higher B^* and thus a smaller threshold. 

It is interesting to notice that the equilibrium price 
only changes in a time slot where the sensing realization 
factor a is large. This means that although operator has 
the freedom to change the price in every time slot, the 
actual variation of price is much less frequent. This leads 
to less overhead and makes it easier to implement in 
practice. Figure [5] illustrates this with different sensing 
costs and a realizations. The left two subfigures corre- 
spond to the realizations of a and the corresponding 
prices with C s ~ 0.48 and G\ = 1. As the sensing 
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TABLE 5 

The Operator's and Users' Equilibrium Behaviors 



JCl LC7-LL IZL V^V-J Z> L 1\C t~±LL LCD 


T— Ti c^Vi Spnsinc^ C^nst' (~) ^> 


Low Sensing Cost: 1 ~ e / U ' < C s < ° Q l 


Optimal Sensing Amount B* 





Bf 1 * 6 [Ge-( 2 + c i),Ge- 2 ], solution to eq. (13) 


Sensing Realization Factor a 


< a < 1 


< a < Gc-l 2 + c i)/-Bf* 


a > Ge-( 2 + c 'i)/B s L * 


Optimal Leasing Amount B* 








Optimal Pricing -rr* 


1 + C t 


i + c. 




Expected Profit Rj 




Rf in eq. Q4) 


J?f in eq. QD 


User i's SNR 


e (2+c",) 






User ;'s Payoff 


g t e-Wi> 




g^B^a/G) 




0.3 ~ 0.4 
Sensing Cost C 



0.2 0.4 0.6 O.B 
Sensing Reliazation Factor a 



0.2 0.4 0.6 u 
Sensing Reliazation Factor a 



Fig. 5. Optimal sensing amount B* Fig. 6. Optimal leasing amount B* as Fig. 7. Optimal price n* as a function 
as a function of C s and Q. a function of C s , Q, and a. 



of C s , Cu and a. 




10 20 3D 40 50 10 20 30 40 50 



Time Slots t with C s =0.48 and C = 1 Time Slots t 2 with C s =0.35 and C = f 

Fig. 8. Optimal price it* over time with different sensing 
costs and a realizations 



cost C s is quite high in this case, the operator does not 
rely heavily on sensing. As a result, the variability of 
a (in the upper subfigure) has very small impact on 
the equilibrium price (in the lower subfigure). In fact, 
the price only changes in 11 out 50 time slots, and the 
maximum amplitude variation is around 10%. The right 
two figures correspond to the case where C s = 0.35 
and C[ = 1. As sensing cost is cheaper in this case, the 
operator senses more and the impact of a on price is 



larger. The price changes in 30 out of 50 time slots, and 
the variation in amplitude can be as large as 30%. 

Observation 4: The operator will sense the spectrum 
only if the sensing cost is lower than a threshold. Fur- 
thermore, it will lease additional spectrum only if the 
spectrum obtained through sensing is below a threshold. 

Observation 5: Each user i obtains the same SNR inde- 
pendent of gi and a payoff linear in t^. 

Observation [5] shows that users obtains fair and pre- 
dictable resource allocation at the equilibrium. In fact, 
a user does not need to know anything about the total 
number and payoffs of other users in the system. It can 
simply predict its QoS if it knows the cost structure of the 
network (C s and C; Such property is highly desirable 
in practice. 

Finally, users achieve the same high SNR at the equi- 
librium. The SNR value is either e ^ +c '^ or G/{B^*a), 
both of which are larger than e 2 . This means that the 
approximation ratio ln(SNRi)/ln(l+SNRj) > ln(e 2 )/ln(l + 
e 2 ) « 94%. The ratio can even be close to one if the price 
7r is high. 

In Sections 13.11 and 13.41 we made the high SNR regime 
approximation and the uniform distribution assumption 
of a to obtain closed-form expressions. Next we show 
that relaxing both assumptions will not change any of 
the major insights. 

5. The analysis of the game, however, does not require the users to 
know Cs or G;. 
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4.1 Robustness of the Observations 

Theorem 4: Observations [l][5] still hold under the gen- 
eral SNR regime (as in (QJ) and any general distribution 
of a. 

Proof: We represent a user i's payoff function in the 
general SNR regime, 



Ui(n,Wi) =Wi1n I 1 + 



TTW, 



(15) 



The optimal demand w*(tt) that maximizes dTBl l is 
w i( n ) = ffi/Ql 71 "): where Q(tt) is the unique positive 
solution to F(tt, Q) := ln(l + Q) - - tt = 0. We find 
the inverse function of Q{tt) to be tt(Q) = \n(l+Q) — y^q- 
By applying the implicit function theorem, we can obtain 
the first-order derivative of function Q(tt) over tt as 

dF(n,Q)/dTr (1 + Q(tt)) 2 
Q{n) = - d F M )/dQ= W) ' { ) 

which is always positive. Hence, Q{tt) is increasing in tt. 
User i's optimal payoff is 



U t (TT,W*{TT)) 



Q(tT 



■Ml + Q(7T))-7r] 



(17) 



As a result, a user's optimal SNR equals gi/w*(Tr) = Q(tt) 
and is user-independent. The total demand from all users 
equals G/Q(tt), and the operator's investment and pric- 
ing problem is 

( G 

R* = max £L=rn n [max max(min tt — , , 

B s >0 1 |L B,>0 7r>0 \ Q(tt) 



SsCs - BiCi)}. 



tt(Bi + S s a) 

(18) 



it* R 



Define R 

( fl8t is equivalent to solving 



and B s 



-|f . Then solving 



i?* = max -E Qe ro ii [maxmax(min ( , 7r(B; + i? s a) 



B S C S — Bid)]. 



(19) 



In Problem (19) , it is clear that the operator's optimal 
decisions on leasing, sensing and pricing do not depend 
on users' aggregate wireless characteristics. This is true 
for any continuous distribution of a. And a user's 
optimal payoff in eq. l fl7t is linear in gi since Q(tt) 
is independent of users' wireless characteristics. This 
shows that Observations [TJ |2 and [5] hold for the general 
SNR regime and any general distribution of a. We can 
also show that Observations [3] and [4] hold in the general 
case, with a detailed proof in Appendix [C] □ 

5 The Impact of Spectrum Sensing Un- 
certainty 

The key difference between our model and most existing 
literature (e.g., ED, IS], 122, ED, 123, E3) is the 
possibility of obtaining resource through the cheaper but 
uncertain approach of spectrum sensing. Here we will 
elaborate the impact of sensing on the performances of 
operator and users by comparing with the baseline case 



where sensing is not possible. Note that in the high 
sensing cost regime it is optimal not to sense, as a result, 
the performance of the operator and users will be the 
same as the baseline case. Hence we will focus on the 
low sensing cost regime in Table |5] 

Observation 6: The operator's optimal expected profit 
always benefits from the availability of spectrum sensing 
in the low sensing cost regime. 

Figure illustrates the normalized optimal expected 
profit as a function of the sensing cost. We assume 
leasing cost C; = 2, and thus the low sensing cost regime 
corresponds to the case where C s G [0.2, 1] in the figure. 
It is clear that sensing achieves a better optimal expected 
profit in this regime. In fact, sensing leads to 250% 
increase in profit when C s = 0.2. The benefit decreases as 
the sensing cost becomes higher. When sensing becomes 
too expensive, the operator will choose not to sense and 
thus achieve the same profit as in the baseline case. 

Theorem 5: The operator's realized profit (i.e., the 
profit for a given a) is a strictly increasing function in a 
in the low sensing cost regime. Furthermore, there exists 
a threshold a t h £ (0, 1) such that the operator's realized 
profit is larger than the baseline approach if a > ath- 

Proof: As in Table |5j we have two cases in the low 
sensing cost regime: 

. If a < Ge~ {2+Cl) /B^*, then substituting B^* into 
Rfj S1 (B s ,a) in Table [3] leads to the realized profit 



B^Cs + B^*aC l 



which is strictly and linearly increasing in a. 
If a > Ge~ (2+C,) /B%*, then substituting B%* into 
(B s ,a) in Table |3] leads to the realized profit 



R CS2 



R?P(a) = B 



In 



G 



Bi" 



a 



- 1 - sra 



Because the first-order derivative 



dRff 2 (c 
da 



Bi 



In 



( G 



\Bi* 



- 2 > 0, 



as Bg < Ge , Rjj (a) is strictly increasing in a. 

We can also verify that R ( ff 1 (a) = Rff 2 (a) when 
a = Ge~( 2+Cl } I Bg* . Therefore, the realized profit is a 
continuous and strictly increasing function of a. 

Next we prove the existence of threshold a t h- First 
consider the extreme case a — 0. Since the operator ob- 
tains no bandwidth through sensing but still incurs some 
cost, the profit in this case is lower than the baseline 
case. Furthermore, we can verify that Rff 2 (l) > R^ in 
Table |5j thus the realized profit at a = 1 is always larger 
than the baseline case. Together with the continuity and 
strictly increasing nature of the realized profit function, 
we have proven the existence of threshold of a t h- □ 

Figure [10] shows the realized profit as a function of a 
for different costs. The realized profit is increasing in a in 
both cases. The "crossing" feature of the two increasing 
curves is because the optimal sensing B* is larger under 
a cheaper sensing cost (C s = 0.5), which leads to larger 



11 




Fig. 9. Operator's normalized optimal Fig. 10. Operator's normalized opti- Fig. 11. User i's normalized optimal 
expected profit as a function of C s mal realized profit as a function of a. realized payoff as a function of a. 
and G\. 



realized profit loss (gain, respectively) when a — > 
(a — > 1, respectively). This shows the tradeoff between 
improvement of expected profit and the large variability 
of the realized profit. 

Theorem 6: Users always benefit from the availability 
of spectrum sensing in the low sensing cost regime. 

Proof: In the baseline approach without sensing, the 
operator always charges the price 1 + Cj. As shown in 
Table |5j the equilibrium price tt* with sensing is always 
no larger than 1 + Ci for any value of a. Since a user's 
payoff is strictly decreasing in price, the users always 
benefit from sensing. □ 

Figure [TT] shows how a user i's normalized realized 
payoff u*/gi changes with a. The payoff linearly in- 
creases in a when a becomes larger than a threshold, 
in which case the equilibrium price becomes lower than 
1+Cf. A smaller sensing cost C s leads to more aggressive 
sensing and thus more benefits to the users. 

6 Conclusions and Future Work 

This paper represents some initial results towards under- 
standing the new business models, opportunities, and 
challenges of the emerging cognitive virtual mobile net- 
work operators (C-MVNOs) under supply uncertainty. 
Here we focus on studying the trade-off between the 
cost and uncertainty of spectrum investment through 
sensing and leasing. We model the interactions between 
the operator and the users by a Stackelberg game, which 
captures the wireless heterogeneity of users in terms of 
maximum transmission power levels and channel gains. 

We have discovered several interesting features of the 
game equilibrium. We show that the operator's opti- 
mal sensing, leasing, and pricing decisions follow nice 
threshold structures. The availability of sensing always 
increases the operator's expected profit, despite that the 
realized profit in each time slot will have some variations 
depending on the sensing result. Moreover, users always 
benefit in terms of payoffs when sensing is performed 
by the operator. 

To keep the problem tractable, we have made several 
assumptions throughout this paper. Some assumptions 



can be (easily) generalized without affecting the main 
insights. 

« Imperfect sensing: we can incorporate imperfect spec- 
trum sensing (i.e., miss-detection and false-positive) 
into the model, which will change the uncertainty of 
the spectrum sensing. Given that our results work 
for any distribution of the sensing realization a, it is 
likely that such generalization does not change the 
major insights. 

• Learning: we can also consider the interactions of 
multiple time slots, where the sensing realizations 
of previous time slots can be used to update the 
distributions of a in future time slots. Again, the 
per slot decision model introduced in this paper is 
still applicable with a time-dependent a distribution 
input. 

Generalizations of some other assumptions, however, 
lead to more challenging new problems. 

• Incomplete information: when the operator does not 
know the information of the users, the system needs 
to be modeled as a dynamic game with incomplete 
information. More elaborate economic models such 
as screening and signaling [42] become relevant. 

• Time scale separation: it is possible that dynamic 
leasing is performed at a different (much larger) 
time scale compared with spectrum sensing. In that 
case, the operator has to make the leasing decision 
first, and then make several sequential sensing deci- 
sions. This leads to a dynamic decision model with 
more stages and tight couplings across sequential 
decisions. 

• Operator competition: There may be multiple C- 
MVNOs providing services in the same geographic 
area. In that case, the operators need to attract 
the users through price competition. Also, if they 
sense and lease from the same spectrum owner, 
the operators may have overlapping or conflicting 
resource requests. Although we have obtained some 
preliminary results along this line in (43|, more 
studies are definitely desirable. 

Through the analytical and simulation study of an 
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idealized model in this paper, we have obtained vari- 
ous interesting engineering and economical insights into 
the operations of C-MVNOs. We hope that this paper 
can contribute to the further understanding of proper 
network architecture decisions and business models of 
future cognitive radio systems. 

Appendix A 

Proof of Theorem QQ 

Given the total bandwidth Bi + B s a, the objective of 
Stage III is to solve the optimization problem ((8), i.e., 
maxTT^o min(Z?(7r), S(tt)). First, by examining the deriva- 
tive of D{tt), i.e., dD{ix)/dTX = (1 - Tr)Ge~ { - 1+ ' K \ we can 
see that the continuous function D(tt) is increasing in 
tt G [0,1] and decreasing in tt G [l,+oo], and D(ir) is 
maximized when n = 1. Since S(tt) always increases in 
tt and D(tt) is concave over tt G [0, 1], 5(7r) intersects 
with D(ir) if and only if 
Bi +B s a < Ge~ l . 

Next we divide our discussion into the intersection 
case and the non-intersection case: 

1) Given Bi +B s a < Ge _1 , S(tt) intersects with D(tt). 
By solving equation S(ir) = D(tt) the intersection 

point is 7r = In (^ Bi + B — 1- There are two sub- 
cases: 

• when B[ + B s a < Ge~~ 2 , S(tt) intersects with 
D(ir), and min(_D(7r), S(ir)) is maximized at the 



dD(ir) 



> » at tt = 0, i.e., 



In 



G 



Bi+B s a 



1. 



intersection point, i.e., tt 
(See 5 3 (vr) in Fig. [3J) 
• when Bi + B s a > Ge~~ 2 , S(tt) intersects with 
D(ir), and min(_D(7r), S(ir)) is maximized at the 
maximum value of D(tt), i.e., tt* = 1. (See S^tt) 
in Fig. |3) 

2) Given Bi + B s a > Ge -1 , S(tt) doesn't intersect with 
D(tt). Then min(_D(7r), S^tt)) is maximized at the 
maximum value of D(tt), i.e., tt* = 1. (See Si(ir) in 
Fig. EJ) ■ 

Appendix B 

Proof of Theorem [2] 

Given the sensing result B s a, the objective of Stage 
II is to solve the decomposed two subproblems (To} 
and (IT) , and select the best one with better optimal 
performance. Since Rffj(B s ,a, Bi) in subproblem dip) 
is linearly decreasing in Bi, its optimal solution always 
lies at the lower boundary of the feasible set (i.e., B^ = 
max{Ge~ 2 — B s a, 0}). We compare the optimal profits of 
two subproblems (i.e., Rff(B s ,a) and Rff(B s ,a)) for 
different sensing results: 

1) Given B s a > Ge~ 2 , the obtained bandwidth after 
Stage I is already in excessive supply regime. Thus 
it is optimal not to lease for subproblem iTTO) (i.e., 
Bf S3 = of case (ES3) in Table©. 

2) Given < B s a < Ge~ 2 , the optimal leasing deci- 
sion for subproblem (IT) is B\ = Ge~ 2 — B s a and 



we have Rfj S j(B Sl a, B{) = R ( ^ I s I {B s ,a, B{) when 
Bi = Ge~ 2 — B s a, thus the optimal objective value 
of (TO) is always no larger than that of (TT) and it is 
enough to consider the conservative supply regime 
only. Since 

d 2 R?f I (B s ,a,B l ) 



dBf 



1 

B t +B s a 



<0, 



Ryfj(B s ,a, Bi) is concave in < B l < Ger 2 - B s a. 
Thus it is enough to examine the first-order condi- 
tion 



dBi 



In 



G 



Bi + B s a 



C\ = 0, 



and the boundary condition < B\ < Ge 2 — B s a. 
This results in optimal leasing decision B* = 

max(Ge-( 2+Ci ) - B s a,0) and leads to Bf si = 
Ge ~(2+c l ) _ Bg0l and B cs2 = o of cases (CS1) and 

(CS2) in Table 13 
By substituting Bf si and Bf S2 into Rff I (B s ,a,Bi) 
in Table |2 we derive the corresponding optimal profits 
Rfj S1 (B s ,a) and Rfj S2 (B Sl a) in Table 1 Rff 3 {B s ,a) 
can also be obtained by substituting Bf S3 into 



Appendix C 

Supplementary Proof of Theorem [4] 

In this section, we prove that Observations |3] and UJ 
hold for the genera case (i.e., the general SNR regime 
and a general distributions of a). We first show that 
Observation |4] holds for the general case. 

C.1 Threshold structure of sensing 

It is not difficult to show that if the sensing cost is 
much larger than the leasing cost, the operator has 
no incentive to sense but will directly lease. Thus the 
threshold structure on the sensing decision in Stage I still 
holds for the general case. We ignore the details due to 
space limitations. 

C.2 Threshold structure of leasing 

Next we show the threshold structure on leasing in Stage 
II also holds. Similar as in the proof of Theorem [TJ we 
define D(n) = ^Q^fy and 5(7r) = ir(B 8 a + B{). 

• We first show that D(tt) is increasing when tt £ 
[0,0.468] and decreasing when n G [0.468, +00). To 
see this, we take the first-order derivative of D(tt) 
over tt, 

2Q(tt) 2 + Q{tt) - (1 + Q{tt)) 2 ln(l + Q{tt)) 



D'M 



which is positive when Q(tt) g [0,2.163) and nega- 
tive when Q(tt) g [2.163, +00). Since eq. ( fT6t shows 
that Q(tt) is increasing in tt and tt(Q) |q=2.163= 
0.468, as a result D(n) is increasing in tt G [0,0.468] 
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0.468 



Fig. 12. Different intersection cases of S(tv) and D(7r) in 
the general SNR regime. 



DXB/G) 



C,G 




Fig. 13. The relation between the normalized total band- 
width B/G and the derivative of the revenue D'(B/G). 



and decreasing in 7r e [0.468, +oo). In other words, 
D(ir) is maximized at ir = 0.468. 
Next we derive the operator's optimal pricing de- 
cision in Stage III. Figure [12] shows two possible 
intersection cases of S(tt) and D(tt). B t hi is defined 
as the total bandwidth obtained in Stages I and II 
(i.e., B s a + Bi) such that S(tt) intersects with D(ir) 
at 7r = 0.468. Here is how the optimal pricing is 
determined: 

- If B s a+Bi > B th i (e.g., S^n) in Fig.E), the op- 
timal price is ir* = 0.468. The total supply is no 
smaller (and often exceeds) the total demand. 

- If B s a + B l < B m (e.g., S 2 (ir) in Fig. E), the 
optimal price occurs at the unique intersection 
point of £(71") and D(ir) (where D(tt) has a 
negative first-order derivative). The total supply 
equals total demand. 

Now we are ready to show the threshold structure 
of the leasing decision. 

- If the sensing result from Stage I satisfies B s a > 
Bthi, then the operator will not lease. This 
is because leasing will only increase the total 
cost without increasing the revenue, since the 
optimal price is fixed at it* = 0.468 and thus 
revenue is also fixed at D(ir*). 

- Let us focus on the case where the sensing result 
from Stage I satisfies B s a < B t h\- Let us define 
B = B s a + Bi, then we have B = G/Q(ir) and 
7T = ln(l + G/B) - G/{G + B). This enables us 



to rewrite D(ir) as a function of total resource 
B only, 

G 



D(B) = B 



The first-order derivative of D(B) is 
D'(B) = In (l 



B/G J l + B/G {l + B/G) 2 ' 

(20) 

which denotes the increase of revenue D(B) due 
to unit increase in bandwidth B. Since obtaining 
each unit bandwidth has a cost of C; in Stage II, 
the operator will only lease positive amount of 
bandwidth if and only if D'(B s a) > Cj. To facil- 
itate the discussions, we will plot the function 
of D'(B/G) in Fig. [13] with the understanding 
that D'(B/G) = D'{B)G. The intersection point 
of B/G = 0.462 in Fig. [13] corresponds to the 
point of 7T = 0.468 in Fig. [32] The positive part 
of D'(B) on the left side of B/G = 0.462 in 
Fig. [131 corresponds to the part of D(ir) with a 
negative first-order derivative in Fig. [12] For any 
value Ci, Fig.[l3]shows that there exists a unique 
threshold B th2 (d) such that D> {B^d) / G) = 
QG, i.e., D'(B th2 (Ci)) = Q. Then the optimal 
leasing amount will be B t h 2 (Gi) — B s a if the 
bandwidth obtained from sensing B s a is less 
than Bth 2 iGi), otherwise it will be zero. 

C.3 Threshold structure of pricing and Observation 
® 

Based on the proofs above, we show that Observation [3] 
also holds for the general case as follows. Let us denote 
the optimal sensing decision as B*, and consider two 
sensing realizations a.\ and a 2 in time slots 1 and 2, 
respectively. Without loss of generality, we assume that 

a l < Q; 2- 

• If B*a 2 > Bthi, then the optimal price in time slot 2 
is 7r* = 0.468 (see Fig. [12]|. The optimal price in time 
slot 1 is always no smaller than 0.468. 

• If B*a± < B*a 2 < B t hi, then we need to consider 
three subcases: 

- If B*ai < B*a 2 < B t hi{Gi), then the operator 
will lease up to the threshold in both time slots, 
i.e., B* = B t h2{Ci) - B*ai in time slot 1 and 
B* = B t h 2 (Ci) - B*a 2 in time slot 2. Then 
optimal prices in both time slots are the same. 

- If B*a\ < Bth 2 (Ci) < B*a 2 , then the operator 
will lease _B ; * = B t h 2 (Ci) — B*ai in time slot 1 
and will not lease in time slot 2. Thus the total 
bandwidth in time slot 1 is smaller than that of 
time slot 2, and the optimal price in time slot 1 
is larger. 

- If B t h 2 {Ci) < B*ai < B*a 2 , then the operator 
in both time slots will not lease and total band- 
width in time slot 1 is smaller, and the optimal 
price in time slot 1 is larger. 
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To summarize, the optimal price ir* in Stage III is 
non-increasing in a. And the operator will charge a 
constant price (it* = 0.468) to the users as long as 
the total bandwidth obtained through sensing and 
leasing does not exceed the threshold B th2 (Ci). ■ 
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